Automatic Text Localisation in Scanned Comic Books

نویسندگان

  • Christophe Rigaud
  • Dimosthenis Karatzas
  • Joost van de Weijer
  • Jean-Christophe Burie
  • Jean-Marc Ogier
چکیده

Comic books constitute an important cultural heritage asset in many countries. Digitization combined with subsequent document understanding enable direct content-based search as opposed to metadata only search (e.g. album title or author name). Few studies have been done in this direction. In this work we detail a novel approach for the automatic text localization in scanned comics book pages, an essential step towards a fully automatic comics book understanding. We focus on speech text as it is semantically important and represents the majority of the text present in comics. The approach is compared with existing methods of text localization found in the literature and results are presented.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Box Flame Detection and Image Normalization in Comic Images

In recent years, digital comic images are popular on the Internet. In general, they are scanned from comic books, but most of them are not normalized about their size, skew and margin. The normalization of comic images in preprocessing step has many advantages in automatic comic image analysis such as content-based comic image retrieval systems. The normalization in comic image can be carried o...

متن کامل

Speech balloon contour classification in comics

Comic books digitization combined with subsequent comic book understanding create a variety of new applications, including mobile reading and data mining. Document understanding in this domain is challenging as comics are semi-structured documents, combining semantically important graphical and textual parts. In this work we detail a novel approach for classifying speech balloon in scanned comi...

متن کامل

Localisation contextuelle des personnages de bandes dessinées

RÉSUMÉ. Les auteurs proposent une méthode de localisation des personnages dans des cases de bandes dessinées en s’appuyant sur les caractéristiques des bulles de dialogue. L’évaluation montre un taux de localisation des personnages allant jusqu’à 65%. ABSTRACT. The authors present a new method to localize comic’s characters inside comic books’ panels relying on speech balloons properties. The e...

متن کامل

Are Comic Books an Effective Way to Engage Nonmajors in Learning and Appreciating Science?1

Comic books employ a complex interplay of text and images that gives them the potential to effectively convey concepts and motivate student engagement. This makes comics an appealing option for educators trying to improve science literacy about pressing societal issues involving science and technology. Here, we report results from the first systematic assessment of how a science comic book can ...

متن کامل

Robust Frame and Text Extraction from Comic Books

Comic books constitute an important heritage in many countries. Nowadays, digitization allows to search directly from content instead of metadata only (e.g. album title or author name). Few studies have been done in this direction. Only frame and speech balloon extraction have been experimented in the case of simple page structure. In fact, the page structure depends on the author which is why ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013